Relevant Attribute Discovery in High Dimensional Data: Application to Breast Cancer Gene Expressions

نویسندگان

  • Julio J. Valdés
  • Alan J. Barton
چکیده

In many domains, the data objects are described in terms of a large number of features. The pipelined data mining approach introduced in [12] using two clustering algorithms in combination with rough sets and extended with genetic programming, is investigated with the purpose of discovering important subsets of attributes in high dimensional data. Their classification ability is described in terms of both collections of rules and analytic functions obtained by genetic programming (gene expression programming). The Leader and several k-means algorithms are used as procedures for attribute set simplification of the information systems later presented to rough sets algorithms. Visual data mining techniques including virtual reality were used for inspecting results. The data mining process is setup using high throughput distributed computing techniques. This approach was applied to Breast Cancer gene expression data and it led to subsets of genes with high discrimination power with respect to the decision classes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شناسایی ژن‌های مرتبط با بقا در سرطان کلیه با استفاده از روش مؤلفه‌های اصلی لاسو

Background: Identification of correlated genes with survival by gene expression data is an important application of microarray data. The purpose of this study is to identify correlated genes with survival of conventional renal cell carcinoma (cRCC) patients based on gene expression profiles. Methods: This study is a survival analysis with high dimensional covariates and containing 14814 gene...

متن کامل

Identification of Prognostic Genes in Her2-enriched Breast Cancer by Gene Co-Expression Net-work Analysis

Introduction: HER2-enriched subtype of breast cancer has a worse prognosis than luminal subtypes. Recently, the discovery of targeted therapies in other groups of breast cancer has increased patient survival. The aim of this study was to identify genes that affect the overall survival of this group of patients based on a systems biology approach. Methods: Gene expression data and clinical infor...

متن کامل

Investigation of p53 and p27 expressions in the N-nitroso-N-methylureainduced breast cancer in female Wistar Albino rats

Introduction: N-nitroso-N-methylurea (NMU) is a carcinogen from nitrosamines family, which has been used to induce breast cancer in rodents. This model of breast cancer is very similar to the estrogen dependent breast cancer in human. As a continuation of our recent works, in the present study, the expressions of both p53 and p27 were investigated in NMU-induced breast cancer in Wistar Albin...

متن کامل

The effects of interval aerobic training on mesenchymal biomarker gene expression, the rate of tumor volume, and cachexia in mice with breast cancer

Objective(s): It seems that regular exercise can have inhibitory effects on the progression of breast cancer. This study, therefore, aimed to investigate the influences of interval aerobic training on mesenchymal biomarker gene expression, muscle cachexia, and tumor volume changes in mice with breast cancer.Materials and Methods: Thirty-...

متن کامل

Breast cancer classification by proteomic technologies: current state of knowledge.

Breast cancer is traditionally considered as a heterogeneous disease. Molecular profiling of breast cancer by gene expression studies has provided us an important tool to discriminate a number of subtypes. These breast cancer subtypes have been shown to be associated with clinical outcome and treatment response. In order to elucidate the functional consequences of altered gene expressions relat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006